07:48
2026-06-27
the-decoder.com
large-language-models
ByteDance's "iLLaDA" is a diffusion language model that keeps up with Qwen2.5
ByteDance and Renmin University researchers released iLLaDA, an 8B diffusion language model that matches Qwen2.5 on base benchmarks but lags after fine-tuning. The model, trained from scratch on 12 trโฆ